# Document Image Classification
Dit Base Finetuned Rvlcdip Finetuned Data200
This model is a fine-tuned version of microsoft/dit-base-finetuned-rvlcdip on an image folder dataset, primarily used for image classification tasks.
Image Classification
Transformers

D
AthiraVr
16
0
Finetuned Vit Image Text Classifier
Apache-2.0
An image classification model based on the ViT architecture, designed to identify whether an image contains text and the type of text (Latin, Chinese, Arabic)
Image Classification
Transformers

F
ernie-ai
45
0
Dit Base Finetuned Brs
An image classification model fine-tuned based on microsoft/dit-base, performing well on the image folder dataset
Image Classification
Transformers

D
sergiocannata
13
0
Donut Base Finetuned Rvlcdip
MIT
Donut is an OCR-free document understanding Transformer model that combines a visual encoder and text decoder to process document images.
Image-to-Text
Transformers

D
naver-clova-ix
125.36k
13
Dit Large Finetuned Rvlcdip
Document image classification model pretrained on IIT-CDIP and fine-tuned on RVL-CDIP, using Transformer architecture
Image Classification
Transformers

D
microsoft
67
8
Dit Base Finetuned Rvlcdip
DiT is a Transformer-based document image classification model, pretrained on the IIT-CDIP dataset and fine-tuned on the RVL-CDIP dataset
Image Classification
Transformers

D
microsoft
31.99k
30
Featured Recommended AI Models